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Abstract. According to a theorem of Courcelle monadic second-order logic and guarded 
second-order logic (where one can also quantify over sets of edges) have the same expressive 
power over the class of all countable fc-sparse hypergraphs. In the first part of the present 
paper we extend this result to hypergraphs of arbitrary cardinality. In the second part, we 
present a generalisation dealing with methods to encode sets of vertices by single vertices. 



INTRODUCTION 

Guarded second-order logic (GSO) is the variant of monadic second-order logic (MSO) where 
one can not only quantify over sets of vertices but also over sets of edges. This modification 
results in a large increase of expressive power. Statements that can be expressed in guarded 
second-order logic, but not in monadic second-order logic, include the existence of certain 
minors in a graph and the existence of Hamiltonian paths. 

The high expressive power of guarded second-order logic means that most GSO-theories 
are quite complicated. In [7] Seese has shown that every class of graphs with infinite tree 
width has an undecidable GSO-theory. This result immediately generalises to hypergraphs. 
It follows that all classes of hypergraphs with a decidable GSO-theory are k-sparse, for 
some k, which roughly means that their members have few edges. For classes of countable 
A;-sparse hypergraphs, Courcelle [2] has shown that every GSO-formula is equivalent to an 
MSO-formula over such a class. It follows that over every class of countable hypergraphs 
with a decidable GSO-theory guarded second-order logic and monadic second-order logic 
have the same expressive power. Unfortunately, the proof of Theorem 1.4 in [2] contains an 
error. In the first part of the present article we give a new proof of this theorem. In addition, 
we extend the result from countable hypergraphs to hypergraphs of arbitrary cardinality. 

When we look at the results of the first part we see that most of them concern the 
coding of sets of vertices by single vertices. In the abstract, this problem can be stated as 
follows: given a set F C £P(V) of finite sets of vertices, find a definable function / : F — > V 
that is injective. In our concrete case, F := E is the set of edges. In the second part of the 
paper we consider more general instances of this problem where F can be arbitrary. This 
generalisation is inspired by a result of Colcombet and Loding [1] on set interpretations. Their 
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main technical result is a method to transform a definable finite-to-one function F — > V into 
an injective one. Colcombet and Lodung consider as background structure only the infinite 
binary tree. Below we show that using guarded second-order parameters, i.e., sets of edges, 
we can extend some of their results to arbitrary graphs. 

The overview of the article is a follows. We start in Section Q] with basic definitions 
and a survey of results on definable orientations of sparse hypergraphs. In Section [2] we 
prove the general version of the one technical result of [2j whose proof does not extend to 
arbitrary cardinalities. In Section [3] we summarise the consequences for the expressive power 
of guarded second-order logic on sparse hypergraphs. 

Section [¥] contains the second part of the article. We study network flow problems and 
we show how to use flows to transform definable finite-to-one maps into injective ones. 

1. Orientations of sparse hypergraphs 

Let us fix our terminology regarding graphs and hypergraphs. When we say 'graph' we will 
mean an undirected one. Undirected graphs will always be simple and loop free, whereas 
directed graphs will be simple, but they may contain loops. When dealing with hypergraphs 
we will sometimes allow multiple edges. Such a hypergraph is a two-sorted structure (U, E, I) 
where V is the set of vertices, E the set of edges, and I C V x E the incidence relation. 
Using sloppy notation we will tacitly identify an edge e E E of such a hypergraph with the 
set { v € V | (v,e) € / } of its vertices and we write v E e instead of (v, e) 6 /. Similarly, if 
F C E is a set of edges then the union (J F consists of all vertices incident with at least one 
edge of F. We will use this notation even if there are multiple edges. 

Monadic second-order logic (MSO) extends first-order logic by variables and quantifiers 
that range over sets of vertices. Similarly, guarded second-order logic (GSO) extends first- 
order logic by variables and quantifiers ranging over sets of vertices or sets of edges (for 
detailed definitions see [1]). We will also consider weak monadic second-order logic (WMSO) 
where quantification is restricted to finite sets of vertices. 

Definition 1.1. Let fj = (V,E) be a hypergraph. 

(a) We say that S) has rank m if every edge of f) has at most m vertices. 

(b) A subhypergraph of S) is a hypergraph S)q = (Vb, Eq) with Vq Q V and Eq C E. 

(c) Let CCU. The subhypergraph induced by C is 

Si\ c ■= (C,E\ C ) with E\ c :={ee£|eCC}. 

In order to translate GSO-formulae into MSO-formulae, we have to encode sets of edges 
by sets of vertices. A simple way to do so consists in choosing an orientation of the hyper- 
graph, i.e., a function assigning to each edge one of its vertices. 

Definition 1.2. Let fj = (V, E) be a hypergraph. 

(a) An orientatior$\ of S) is a function / : E — > V with /(e) £ e, for all e € E. We say that 
a formula <p(x, Y) defines an orientation / of Sj if we have 

fj |= ip (a, e) iff /(e) = a , for all a € A and e € E . 

(b) An orientation / is bounded by k if 

|/ -1 (a)|<A;, for all a e A. 



This is called a semi- orientation in [2]. 
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(c) We call f) MSO -orientable if there exist an MSO-formula ip(x, Y; P) with parameters 
Pi defining an orientation of $). Similarly, we say that fj is GSO -orientable if there 
exist a GSO-formula ip(x, Y ; P, Q) with parameters Pj C V and Qi C E defining an 
orientation of fj. 

In this paper we are mainly interested in sparse hypergraphs, i.e., hypergraphs with few 
edges. 

Definition 1.3. A hypergraph f) = (V,E) is /c-sparsd3 if 

l-E'lx) < k ■ \X\ , for every finite set X C V . 

Lemma 1.4. Let (5 = (V, E) be a graph. 

(a) If the degree of & is at most 2k, then <3 is k-sparse. 

(b) // & is planar, then it is 3-sparse. 

Proof. 

(a) If X = (X, F) is a finite induced subgraph of (5 then 

2-\F\ = ^deg(Tj) < 2k-\X\. 

(b) This follows from the fact that every planar graph with n vertices has at most 3re — 6 
edges (see, e.g., Corollary 4.2.10 of [3]). □ 

In the next section we will prove that every hypergraph of bounded rank is GSO-orientable. 
In the remainder of this section we show that fc-sparse hypergraphs are even MSO-orientable. 
For countable hypergraphs these results are all due to Courcelle [2]. The only thing new in 
the present section are two applications of the compactness theorem for first-order logic to 
extend the results to uncountable hypergraphs. The proofs in Section [21 on the other hand, 
are mostly new. 

Lemma 1.5. A hypergraph S) = (V,E) (possibly with multiple edges) of finite rank is k- 
sparse if and only if there exists an orientation of Sj that is bounded by k. 

Proof. For («=), let X C V be finite. Then 

\E\ X \ < X)|/ _1 (a)| <*-|*l- 

(=>) First, let us consider the case where Sj is finite. If / is an arbitrary orientation of S) then 

Y}r\a)\ = \E\<k-\V\. 

a&V 

Hence, if there is some element a £ V with |/ _1 (a)| > k then there must be some other 
element b E V with < k. Let us define the weight of an orientation by 

Hf) ■= E { l/~ X ( a )l ~k\aeV, \r\a)\ >k). 
We have to construct an orientation of weight 0. To do so we transform an orientation / with 
w(f) > into one with smaller weight. Given /, fix an element a € V with \f~ 1 (a)\ > k. 
Let FC£be the smallest subset of E such that a belongs to the set U := (J F and we have 

In [2| such hypergraphs are called uniformly k-sparse. Courcelle also introduces a notion of a k-sparse 
graph. Since uniform sparsity is the more robust notion, and the only one we will use in this paper, we have 
changed terminology for brevity. A related notion is the arboricity of a graph (see, e.g., Section 2.4 of [5]). 
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/ (c) ^ F, for every element c 6 (/. The subhypergraph fj|;y induced by U is /c-sparse. 
Hence, there exists some element b G U with |/ _1 (fr)| < k. By choice of F we can find a 
sequence of edges eo, . . . , e n £ F with 

b G e , /(ej) G e i+ i , and f(e n ) = a . 

We define a new orientation g by setting 

b if e = eo , 

/(ei_i) if e = e$, i > 0, 
/(e) otherwise. 



5(e) := 
It follows that 

|0 _1 O«O| 



f|/-i(a)|-l ifx = a, 
irH^I + l ifx = 6, 
|/ _1 0c)| otherwise. 

Hence, tu(<?) < w(f). Repeating this construction we obtain an orientation / with w(f) = 0. 

The general case where f) may be infinite can be proved using the compactness theorem 
for first-order logic. Let A be the elementary diagram of f) (i.e., the set of all first-order 
formulae with parameters that hold in S) ; see [5] for details) where we consider $j as a two- 
sorted structure (V, E, I) with a binary incidence relation /. We can write down a formula <p 
stating that / : E — > V is a function such that 

• (/(e), e) <E I , for all e G E , 

• |/ _1 (a)| < k, for all a G V . 

By assumption and the first part of the proof, every finite subset of A U {</?} is satisfiable. 
Therefore, according to the compactness theorem, there exists a model fj + = (V + , E + , I + , f + ) 
of A U {<£>}. By the Diagram Lemma (see, e.g., [5]), we can find an elementary embedding 
h : 9) — > fj + (i.e., an embedding preserving every first-order formula). Since every edge of 9) 
has only finitely many vertices it follows that 

(a, h(e)) G I + implies a = h(v) , for some ti£e. 
Hence, we can define the desired orientation of fj by / := h^ 1 o /+ o h. □ 

It turns out that the orientation obtained via the preceding lemma is MSO-definable. 
The following sequence of lemmas shows how we can encode such an orientation by a finite 
set of unary predicates. 

Definition 1.6. Let 9) = (V,E) be a directed graph and (5 an undirected one. 

(a) Every orientation / of & induces an directed graph ©/by orienting every edge e of & 
such that it points to the vertex /(e). 

(b) An $)- orientation of (3 consists of a pair (/, h) where / is an orientation of (5 and h is 
a homomorphism (8f — > S). 

We say that an fj-orientation (/, h) is bounded by k if / is bounded by k. 

(c) We say that a family (P v ) v ev °f unary predicates encodes an i>orientation (f,h) of & 
if P v = h^iv), for all v G V. 

Lemma 1.7. For every finite graph f), there exists a first- order formula (p^{X) such that 
<S \= <ffi(P) iff the tuple P encodes an $)- orientation of (5 . 
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Proof. Let uq, . . . , u n -\ be an enumeration of the vertices of Sj. All ip^^X) has to say is that 
the Xi form a partition of the vertices (some Xi may be empty) and that there is no edge 
{v, w} of (3 such that v £ Xi, w £ X^ and (m, u^) is not an edge of fj. □ 

Theorem 1.8 (Nesetfil, Sopena, Vignal |6j). For every k < oj, there exists a finite loop- 
free directed graph with antisymmetric edge relation that has the following property. For 
every finite directed graph (&, with irreflexive and antisymetric edge relation and indegree at 
most k, there exists a homomorphism & — > T^. 

Corollary 1.9. Every k-sparse undirected graph has a T^- orientation which is bounded by k. 

Proof. In Lemma ll.5[ we have shown that such a graph (3 = (V, E) has an orientation 
/ : E —> V that is bounded by k. It follows that &f has indegree at most k. By the theorem, 
there exists a homomorphism h : <3f —> %k- Thus, (/, h) is the desired T^-orientation. O 

Lemma 1.10. For every k <uj, there exists a first- order formula n^^X) such that 

© \= r/k(P) iff P encodes a Tfc- orientation of (5 that is bounded by k . 

Proof. Note that the homomorphism h of a Tfc-orientation (/, h) uniquely determines the 
orientation / since the edge relation of is antisymmetric. In particular, the parameters P 
encoding (/, h) tell us whether / is bounded by k. Hence, we can obtain rjk(X) by adding a 
check for boundedness to the formula (X) of Lemma [1,71 □ 

Corollary 1.11. The class of all k-sparse undirected graphs is finitely MSO-axiomatisable. 

Proof. By Lemma 11.51 and Corollary 11.91 it follows that a graph (5 is A;-sparse if and only if it 
has a Tfc-orientation that is bounded by k. Hence, we can use the formula z\Xr]k(X) where 
r/k is the formula from Lemma [1.101 □ 

In order to apply these results to hypergraphs we use the following construction associ- 
ating a graph with every hypergraph. 

Definition 1.12. Let fj = (V, E) be a hypergraph with orientation /. We define a directed 
graph Of{SS) ■= (V,F) with edge relation 

F := { (a, b) \ a 7^ b and there is some edge e € E with a € e and /(e) = b } . 

Lemma 1.13. Let S) = (V,E) be a k-sparse hypergraph of rank m where < k < oj and 
1 < m < oj. Then fj has an orientation f that is bounded by mk 2 such that the edge relation 
ofOf{^f) is antisymmetric. 

Proof. First, we consider the case that is finite. We call an element a £ V bad for an 
orientation / of Sj if there is some element b G V such that Of (fj) contains both edges (a, b) 
and (b,a). Note that this implies that the vertex b is also bad. 
We construct a sequence of orientations (f n ) n such that 



\fnHa)\ < 




if a is bad for /„ 
otherwise , 



and the number of bad elements decreases at every step. We start with an arbitrary orien- 
tation /o bounded by k. 

Given an orientation /„, with the above properties we construct a new orientation / n +i 
with fewer bad elements as follows. Let a be a bad element, set X := f~ 1 (a), and let 

Y := { e I a £ e and f n (e) £ [j X \ {a} } . 
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Since a is bad we have 

|X|<Jfe and \\JX\ < k(m- I). 

Note that every element of the form 6 := f n (e) with e € Y is also bad since, by definition 
of X, there is an edge e' G X with 

b £ e and fn(e') = a . 

Consequently, Of n (Sj) contains the edges (b, a) (since / n (e') = a) and (a, b) (since / n (e) = b). 
It follows that 

\Y\ <k- |U^\WI < k 2 (m- 1). 
We define the new orientation / n +i by 



fn+i(e) 



a if e € y , 
/ n (e) otherwise. 



Then we have 




- 1) if x = a , 
otherwise . 

In particular, f n +i is bounded by mk 2 . By construction, the element a is not bad for f n +i- 
Furthermore, if (b,c) is an edge in Of n+1 (Sj) with b,c ^ a then this edge is induced by an 
edge e in S) with e ^ 1U7. Hence, (6, c) is also an edge of Of n ($)). Therefore, every element 
that is bad for / n +i is also bad for /„. 

It remains to prove the claim for infinite hypergraphs fj. Let <£> be the union of the 
elementary diagram of S) and formulae stating that / is an orientation of fj that is bounded 
by mk 2 and that Of($j) has an antisymmetric edge relation. If 5DT is a model of ^ then there 
exists an embedding h : fj — > 50t and the desired orientation of fj can be obtained via h from 
that of SOT. Hence, it is sufficient to show that is satisfiable. Note that every finite subset 
<Po <P is satisfiable since every finite substructure of S) has an orientation of the desired 
form. By the compactness theorem it follows that 4> is satisfiable. □ 



2. Depth-first spanning trees 

While A;-sparse hypergraphs are MSO-orientable there are hypergraphs without an MSO- 
definable orientation. For instance, the countably infinite clique is such a graph. In this 
section we will show that every hypergraph of bounded rank is at least GSO-orientable. A 
basic tool the proof below is based on is the notion of a spanning tree of a hypergraph. Before 
presenting the rather involved definition for hypergraphs let us start with considering the 
simpler case of graphs. 

For a countable undirected graph (3 we can define a depth-first spanning tree to be a 
spanning tree T of <& where no edge of connects disjoint subtrees of % (see O [3] ; in [3] 
such trees are called normal). To generalise this definition to uncountable graphs we have 
to admit trees of arbitrary ordinal height. Such trees are necessarily order trees, i.e., partial 
orders (T, <) where < is a tree order, that is, a partial order such that any two elements 
have an infimum and, for every element a, the set of all elements below a is well-ordered. 
Unfortunately, we cannot in general hope to have a spanning subgraph that is an order tree, 
since the partial order < requires too many edges. Therefore, we will use a hybrid between 
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an ordinary tree and an order tree. The precise definition of a spanning tree T of a graph <3 
is as follows. Instead of requiring T to be a subgraph of (5 we consider trees T such that 

• for every vertex w of 1 with immediate predecessor v, the edge (v,w) belongs to ©, and 

• for every vertex w of T without immediate predecessor, we can fix an increasing chain 
(ui)i<a of predecessors of w with limit w and a family (iri)i <a of paths from w to Uj. 

Hence, every vertex w of 1 is attached to its predecessors via some auxiliary graph F w that 
is either a single edge or a tree with root w whose leaves form an increasing sequence of 
predecessors of w with limit w. 

Example 2.1. Consider the complete graph R K , for an uncountable cardinal k. We can 
enumerate the vertices of ^ K as (v a ) a<K where the index a ranges over all ordinals less 
than k. As depth-first spanning tree of this graph we can use a chain of length k as follows. 
We set 1 := (T,E) where 

T := { v a | a < k } 

is the set of all vertices and 

E := { (v a , v a+ i) | a < k } U { {v a , v$) \ 5 a limit ordinal and a < 5} . 

The first part of E consists of the successor edges, whereas the second part contains the 
auxiliary graphs F Vg attaching a limit vertex v$ to its predecessors. 

To generalise these ideas to hypergraphs we need a suitable replacement for the trees F w . 
Unfortunately, not every hypergraph has a spanning tree. A typical example is the hyper- 
graph 
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Instead, we will use certain tree-like hypergraphs called priority trees. 

Definition 2.2. Let Sj = (V, E) be a hypergraph. A hyperpath in f) is a sequence eo . . . e n 
of edges such that 

ei n e k / iff \i - k\ < 1 . 

If u G eo \ ei and «£e„\ e n _i then we say that the hyperpath connects u and v. 

Definition 2.3. Let 9) = (V,E) be a hypergraph of rank at most m, (T,F) be a subhyper- 
graph of S) with T = |J F, and suppose that there are partitions 

T = P U • • • U P m _i and F = F U • • • U F m _i . 

(a) Suppose that T = (T, F, L, (Fi)i <m , (Pi)i <m ,v) with L C F and v £ T. We define by 
induction when such a tuple T is a priority tree. The element w is called the root of T and 
L is its set of leaf edges. 

We start the induction with the case where F consists of a single hyperpath eo ■ ■ ■ e n with 
v € eo \ ei, we have L = {e n }, Fq = F, and Pq = T. Then T is a priority tree. We also call 
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T a priority tree if it can be obtained from a priority tree 1' = (T',F',L f , (F()i, (P()i,v) 
with the same root v by adding a hyperpath eo • • • e n such that 

et n T V iff » = , 

eo 2 T' , 

L = L' U {e n } , 

eo, . . . , e n G F fe , 

(e U • • • U e n ) \ T C P fc , 

where A; is the minimal index such that eoHP^ = 0. This is the successor case of the induction 
step. 

Finally, we also have a limit case. Suppose that 

%a = ^ Z/ Q , (i**),, (i**),, „) , for « < /3 , 

is an increasing chain of priority trees. That is, the sequences (T a ) a , (F a ) a , (L a ) a , (F") a , 
and (Pi*) a ar e all increasing, and all trees T a have the same root v. Then T is a priority 
tree if it is the union of this chain, that is, if 

T=[jT a , F=[jF a , L=[jL a , F t =[jFt, Pi = (J If. 

a</3 a</3 a</3 a</3 a</3 

(b) A branch of 1 is a hyperpath eo • ■ ■ e m C F satisfying the following conditions : 

• eo \ ei contains the root v of T. 

• Let ki be the index such that e^ G F^. We have e,+i \e; C Pk i+1 , f° r every i < m. 
Furthermore, if ki ^ ki + i then ki := min { / | ej+i n Pi ^ } . 

(c) With each priority tree 1 we associate two relations, an order < on F defined by 

e < f : iff every branch containing / also contains e, 

and an equivalence relation ~ on T defined by 

u ~ v : iff u, v G Pk , for some /c , and there exists a hyperpath 

eo • • • e m Q Fk connecting u and v . 

Example 2.4. Consider the following priority tree with edges a, b, c, d, e, / where we have 
labelled each vertex in Pi by the index i. The edge colours are given by Fq = {a, b, e}, 
Fi = {c}, F2 = {d, /}. The ordering < is displayed to the right. 



a 
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Recall that a tree order is a partial order such that any two elements have an infimum 
and, for every element a, the set of all elements below a is well-ordered. A preorder is a 
reflexive and transitive relation. Every preorder C induces an equivalence relation C n 
The equivalence classes of this relation are called tZ-classes. 

Lemma 2.5. Let T be a priority tree. The order < on the edges is a tree order. 

The proof consists of a straightforward but tedious induction following the construction 
of priority trees. 

Lemma 2.6. Let fj = (V,E) be a connected hypergraph of rank at most m. 

(a) For each vertex v G V and every set Lq C E of edges, there exists a priority tree 
X = (T, F, L, (Fi)i, (Pi)i,v) with root v such that [j Lq C T and L C Lq. 

(b) For every GSO-formula -&(x, y) (possibly with parameters), there exists a GSO-formula 
(f(x,y) (with parameters) such that, if $ defines a well-order on Lq and % is a priority 
tree as in (a) then (p(x, y) defines a linear order on T. 

Proof. 

(a) Let (ei)i <a be an enumeration of Lq. For every i < a, we fix a hyperpath m = h l . . . h l m . 
connecting v with h % m . = e«. We construct % by induction on i. We start with the hyper- 
path ttq. At step i > we determine the shortest suffix h\ . . . h l m . of the path m that meets 
the tree constructed so far and we add this suffix to the tree. (If e« C T we leave the tree 
unchanged.) We choose the least index k with hi PlPfc = and we put the new edges into F k 
and the new vertices into P k . The limit of this construction is the desired priority tree. 

(b) The equivalence relation ~ associated with % is MSO-definable in S) with the help of 
the parameters T, F, F{, and Pj. We denote the ~-class of a vertex u by [u]. Note that, by 
construction of X, [u] is a hyperpath and [u] contains a unique leaf edge which, furthermore, 
is one of the ends of the hyperpath. We denote by rj(u) the suffix of the hyperpath [u] that 
connects u to the leaf edge in [u]. 

To define the desired order on T we first construct a preorder on T by setting x Q y if 
and only if one of the following conditions is satisfied : 

• x G Pi and y S P^, for i < k. 

• x,y £ Pk and the leaf edge in [x] is -^-smaller than the leaf edge in [y]. 

• x,y G P k , [x] = [y], and rj(x) C rj(y). 

Note that we have x Q y and y C x if and only if r](x) = rj(y). In this case x and y belong to 
the same edge e € F. Hence, every C-class has size at most m. Adding m additional unary 
predicates Qq, . . . , Q m -i such that each Qi contains at most one element of each C-class, 
we can define 

x < y : iff x C y and (either y%x, or we have 

x G Qi and y G Qt , for i < k) . □ 

We have seen that every fc-sparse graph has an MSO-definable orientation / that is bounded 
by k. If we want to encode sets of edges via sets of vertices we can try to encode each 
edge e by a pair (v,i) consisting of the vertex v := /(e) and a number i < k. This idea 
requires a way to linearly order the sets f _1 (v). In [2] Courcelle uses depth-first spanning 
trees to obtain such linear orders. As remarked above one needs to adapt the definition of a 
depth-first spanning tree when one tries to extend these results to uncountable hypergraphs. 
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Definition 2.7. Let S) = (V, E) be a hypergraph of rank m and suppose that 1 = 
(T, <, (F v ) v€ t) is a structure where (T, <} is a tree (of ordinal height) with T C V and 
with every vertex u <G T we associate a set F v Q E of edges. We assume that F u PI F„ = 0, 
for ti/j). 

(a) The set of auxiliary nodes associated to a vertex -y G T is 

A, == WU[J^\ U A *- 

(b) For X CV,we define 

B(X/T) := {v £ T | In A ^ 0} and 0(X/T) := maxB(X/T) . 

(c) T is a depth-first spanning tree of f) if it satisfies the following conditions : 

• For all u ^= v, A u n A v = and 4nT = {u}. 

• For each edge e£fi the set B(e/T) is nonempty and linearly ordered by <. 

• The vertices v £ T are partitioned into the following classes: (0) the root; (si)i <m a 
successor; (ti)i <m a limit; where the successor and limit vertices are subdivided into m 
subclasses. This partition satisfies the following conditions: 

(0) If v has type then it is the root of T and F v = 0. 

(si) If v has type si then it is the (immediate) successor of some vertex u £ T. We have 
F v = {e} with v G e. Futhermore, v is the only vertex in B(e/T) of type s/ and 
P{e\{v}/T)=u. 

(ti) If v has type t\ then it is the limit of an increasing sequence (uj)j <7 of vertices U{ £ T. 
F„ is (the set of edges of) a priority tree with root v. Furthermore, 

{P(e/T) | e a leaf edge of F„ } 

is a cofinal subset of (uj)j and u is the only vertex in B(\J F v /T) with type t[. 

Proposition 2.8. Every connected hypergraph S) has a depth-first spanning tree. 

Proof. If in the definition of a depth- first spanning tree we drop the condition that B(e/T) ^ 
0, for every edge e, then we obtain a structure that we call a partial depth-first spanning 
tree. We construct an increasing sequence 

(T a , <, (F v ) veTa ) , a<n, 

of such partial depth-first spanning trees with the property that, for every connected com- 
ponent C of U a := V \ \J veT A v , the set 

N(C/T a ) := \J { B(e/T a ) | e G E with e n C + } 

is linearly ordered by <. (A connected component of U a is a maximal subset C C U a such 
that the subhypergraph S)\c is connected.) The limit of this sequence will be the desired 
depth-first spanning tree of 9). 

We start by choosing an arbitrary element v G V and setting To := {v} and i 7 ^ := 0. 
For limit ordinals <5, we define Tg := \J a< gT a . For the successor step, suppose that we 
have already defined T a . Fix some connected component C of U a . Note that N(C/T a ) is 
nonempty since S) is connected. We distinguish two cases. 

(1) If N(C/T a ) has a maximal element u then we choose some edge e with eC\ A u ^ $ and 
e Pi C ^ 0, and we fix some vertex d £ e fl C. We add v to T Q as immediate successor 
of u and we set F v := {e}. It follows that A v = e fl Since B(e/T a ) contains at most 
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l e \ i v }\ < m vertices there is some I < m such that B(e/T a ) contains no vertex of 
type S[. Hence, in the new tree T a+ \ we can assign the type s; to v. 
(2) Suppose that N(C/T a ) has no maximal element. We choose a sequence (ej)j< 7 of edges 
with e, fl C 7^ such that the sequence (uj)j< 7 defined by 

Ui := f3(ei/T a ) 

is increasing and cofinal in N{C /T a ). By taking a suitable subsequence we may assume 
that the set of types appearing in B(ei/T a ) is the same for every % < 7. 

For each edge ej, choose some edge hi C C with hi Dei 7^ and set L := {hi \ i < 7 }. 
We select a vertex v £ C and a priority tree S = (5, if, L, F, P, u) such that S C. C. We 
define T a+ i := T^U-ft; } where u is the limit of N(C/T a ) and we set _F„ := £fU{ &i j i < 7 }. 
It follows that A v = S U Uife n U a ). 
It remains to show that the constructed tree T a+ i is a partial depth-first tree where all sets 
N(C/T a+ \) are linearly ordered. We start by showing that each set B{e/T a+ \) with e S -E 
is linearly ordered. If e fl A v = then B(e/T a+ i) = B(e/T a ) and we are done. Otherwise, 
we have B(e/T a+ \) = B(e/T a+ i) U {v}. Note that A v C C implies eflC 7^ 0. Therefore, 
we have B(e/T a ) C N(C/T a ). Since u is larger than every element in N(C/T a ) the claim 
follows. 

Let D be a connected component of ?7 a +i := V \ UajeT^+i We have to show that 
N(D /T a+ i) is linearly ordered. Since U a +\ C J7 Q there is some connected component I?' 
of U a containing D. If D' 7^ C then C7 Q \ C/q+i C C implies that D = D' and the set 

N{D/T a+l ) = N(D'/T a ) 

is linearly ordered. If, on the other hand, DCC then we have 

N(D/T a+1 ) C iV(C7/T Q ) U M 

and the latter set is linearly ordered since v is greater than every element of N(C/T a ). □ 

Remark 2.9. 

(a) If the hypergraph fj is countable then we can actually obtain a depth-first spanning tree 
of height at most oj as follows. In the above proof, if we are slightly more careful in choosing 
the vertex v that is added to the partial tree, then we can ensure that every vertex is chosen 
already after finitely many steps. 

(b) Note that, strictly speaking, the above proposition is not a generalisation of Theorem 1.4 
of [2j since we use a different notion of a depth-first spanning tree. 

We use depth-first spanning trees to encode orientations of a hypergraph. First, we show 
that each depth-first spanning tree can be encoded by finitely many GSO-parameters. 

Lemma 2.10. For every m < uj we can construct MSO -formulae ip(X;Z), $(x, Y;Z), and 
x(x, y; Z) such that, for every connected hypergraph Sj of rank at most m and each depth-first 
spanning tree (T, <, (F v ) v ) of $), there are GSO-parameters S such that 

fj |= (p(P; S) iff P CT is downward <-closed, 

fj^$(v,P;S) iff veTandP = A v , 

$j \= xi u i v i S) iff u,v G T and u < v . 

Proof. We will use the following parameters: 

• Unary predicates To, T Sl , T tp for I < m, containing all vertices of the corresponding type. 
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• F T := |J { F v I v G T T }, for every type r. 

• F:= (J r P T - 

• A T := |J { A v \ v E T T } \ T, for every type r. 

• If v is of type t\ then the set F v forms a priority tree. We use additional parameters F* 1 
and Pi, for i < m, encoding the corresponding partition 

F f > = P*< U---UF^_ 1 and [j P*< = P U ■ ■ ■ U P m _! . 

Hence, for every vertex t> of type we have a priority tree 

(U f v ,f v ,l v , (if n F^i, n U i^, «) . 

(1) First, we construct the formula To simplify our task we define separate formulae 
•& T {x, Y), for each type r, such that 

ft \= T ( W) P) iff „ e T T and P = A, . 

Then we can set 1? := V T $t- 

If v has type then A v = {v} and we can set 

#o(x,Y) := T xAY = {x} . 

If the type of v is s/ then F v = {e} and A v = {v} U(ef1 ^4 S ')) where e is the unique edge in 
F Sl containing v. Hence, we can define 

$ Sl {x,Y) := T Sl x A (Be G F Sl )[x G e A y = {x} U (e n A Si )] • 

Finally, if -y has type ti then _A„ is the least subset of {v} U A tl satisfying the following 
conditions : 

• v g y 

• If e G F*< and e n y / then e n A*' C Y. 
Hence, we can define 

t ,(x,y) := T t; x A y a; A (Ve G F*')(eny + -> e n A tl C y) 

A VZ[Zx A(Ve6F i, )(enZ/Neni' i a)^ya]. 

(2) Next, we define a formula a(x, y) such that 

h a(u, Q) iff v G T and Q = B({J F v /T) . 
The formula a(x, Y) should state that 

y = { u G T | there is some e G F„ with eV\A x ^§ and e fl A u ^ } . 
Using the formulae i? r we can write a as 

a(x, Y) := \J[T T x A (Be G F T )(e n ^ / A e n A u + 0)] . 

T 

(3) With the help of a we can write down the desired formulae (p and %. 

<^(X) := VxVy [Xi A a(x, Y) -> y C X] , 
X (x, y) := VX[^(X) -> (Xy -> Ix)] . □ 
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Lemma 2.11. We can construct GSO -formulae ip m (x , y; Z) , for m < uj, such that, for 
every depth-first spanning tree (T, <, (F v ) v ) of a connected hypergraph S) of rank m, there 
are GSO -parameters S such that the formula ip m (x, y; S) defines a preorder C with the 
following properties: 

• The restriction of C to T coincides with <• 

• C linearly preorderes every set X C V such that B{X/T) is linearly ordered by <. 

• Each \Z -class has at most m elements. 

Proof. Let x{ x iV) an d $(x,Y) be the formulae of Lemma 12.101 For each type r, we define 
a formula r/ T (x,y) linearly preordering each set A v where v is of type r. Then the desired 
formula ip m states that either 

(1) x £ A u and y E A v for u < v, or 

(2) 1,1/ £ A v , for some v of type r, and r] T (x,y) holds. 

If v is of type or S[ then A v contains at most m elements and we can set 

rj T (x, y) := true . 

For vertices of type ti we can use the formula from Lemma [2.61 □ 

Corollary 2.12. We CCLTi COTbStviLCt GSO -foTTilllldC 1pm (x,y;Z), for m < uj, such that, for 
every depth-first spanning tree (T, <, (F v ) v ) of a connected hypergraph Sj of rank m, there 
are GSO -parameters S such that the formula ip m {x,y; S) defines a partial order C with the 
following properties: 

• The restriction of C to T coincides with <. 

• C linearly orderes every set X C V such that B(X/T) is linearly ordered by <. 

Proof. Let C be the preorder from Lemma 12.111 Since every C -class contains at most m 
elements we can add 777- new unary predicates Pq^ • • • •> Pm—i 

such that P U • • • U P m -\ = V 
and we have \X D P{\ < 1, for each C -class X and all i. Then we can define 

u C v : iff either u\Zq v , or 

u Eq v , v Qo u , u € Pi , v € Pfc for i < k . □ 

Theorem 2.13. We can construct GSO-formulae ip m (x,Y; Z), form < to, such that for ev- 
ery hypergraph 9) of rank m, there are GSO -parameters S such that, the formula (fi m (x, Y; S) 
defines an orientation of f). 

Proof. Suppose that fj has k connected components Cj, i < K. For each component Cj we 
fix a depth-first spanning tree (T l , < % , (F%) v ). Let S l be the parameters from Lemma [2.101 
and Lemma 12.111 For every edge e G E, there exists a unique component such that the 
intersection X := efllj^ A l v is finite and nonempty. Furthermore, the set B(X/T l ) is linearly 
ordered by <. Using the ordering C of Corollary 1 2 . 1 2 1 we can write down a formula ip m (v, e) 
stating that v is the C-least element of this set X. □ 

Corollary 2.14. Every hypergraph of rank m < uj is GSO -orientable. 

Let us mention the following consequences of this result. For countable hypergraphs 
they are again due to Courcelle [2]. 

Definition 2.15. 

(a) A formula (f(x, y, Z) defines an edge ordering of a hypergraph S) = {V, E) if, for every 
edge e € E, the formula <p(x, y, e) defines a linear ordering on the vertices of e. 
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(b) A formula (p(x, y, z) defines an neighbourhood ordering of a directed graph & = (V, E) 
if, for every vertex v E V, the formula <p(x, y, v) defines a linear ordering on the set 
{u G V | (u,v) 6 E}. 

Lemma 2.16. There exist GSO -formulae (p m (x,y, Z;U), for m < u, such that, for ev- 
ery hypergraph Sj = (V, E) of rank m, there are GSO -parameters S such that the formula 
tym{x > V > Z ; S) defines an edge ordering off). 

Lemma 2.17. There exist MSO-formulae (p m (x, y, z; U), for m < u, such that, for every 
directed graph <5 of indegree at most m, there are MSO -parameters P such that the formula 
<Pm{x,y,z\P) defines a neighbourhood ordering of (5. 

Proof. We can apply Lemma 12.161 to the hypergraph fj := (V,F) where 

F := {!(«) | v 6 V} with I{v) := { u G V | (ti, v) 6 E } . 

Note that every subset S C can be encoded by the set 

J _1 (F) := {v G y | G F} C y . 

Hence, every GSO-formula over Sj can be translated into an MSO-formula over <S. □ 

3. GSO VERSUS MSO 

In [2] Courcelle has shown that we can translate every GSO-formula ip into an MSO-formula tp 
that is equivalent to ip on all countable fc-sparse hypergraphs. Using the results of the 
previous sections we can lift the restriction to countable hypergraphs. The proof in [2] goes 
through unchanged since it relies only on the statements of Lemma 12.161 and Lemma I2.17[ 
and on local modifications of hypergraphs. 

Theorem 3.1. For all numbers m, k < u, there exists a monadic second-order interpreta- 
tion (with monadic parameters) that maps a k-sparse hypergraph of rank m to its incidence 
structure. 

Corollary 3.2. For all m, k < cu and all formulae (p(x,Y, Z) G GSO with first-order vari- 
ables x, monadic variables Y , and guarded second-order variables Z , there exists a formula 
ifj(x,Y, Z) G MSO with the following property: for all k-sparse hypergraphs 9) = (V,E) of 
rank m and all parameters a% G V , Pi C V , Ri C E, there exist parameters Qi C V such 
that 

ft\=tp(a,P,R) iff $j^i;(a,P,Q). 

4. Sparse distributions 

The results so far concern ways to encode edges by vertices. In this last section we consider 
a more general problem. Let <5 = (V,E) be a graph. We denote by &fin(V) the set of all 
finite subsets of V. We would like to encode a given subset F C 3^^ n {y) by a set of vertices, 
that is, we would like to find a definable function h : F — > V that is injective. For F = E 
this reduces to the problem considered in the preceding sections. For arbitrary F, such a 
function h does not always exist. But we will show that sometimes we can transform a given 
function ho : F — > V into an injective one. 
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These results are inspired by work of Colcombet and Loding [T] on set interpretations. 
Colcombet and Loding consider a power set operation V on structures. One of their main 
results in a commutation theorem for interpretations and the power set operation. They show 
that, given a tree T and an FO-interpretation X such that X{V{%)) is of the form V (£Dt), for 
some structure 9Jt, then there exists a WMSO-interpretation J such that = i7(T). On 
ingredient in the proof of this result is a method to encode, in a definable way, finite subsets 
of the tree T by single vertices. 

Suppose we are given a function ho : F —?■ V that we want to transform into an injective 
function h : F — > V . Let 5(v) := The first step in the construction of h consists 

in finding a definable function g : V — > V such that |<7 _1 (?;)| = S(v), for all v. Of course, 
this is not always possible. For instance, if the graph is finite and we have 8(v) > 1, for 
all vertices v. Therefore, we consider only functions 5 that are sparse in the sense of the 
following definition. 

Definition 4.1. Let & = (V,E) be an undirected graph. 

(a) The border of a subset Z C V is the set 

B & (Z) :=En(V\Z)x Z 

of all edges connecting a vertex in Z with a vertex outside of Z. 

(b) A distribution of © is a map S : V — > U). For X C V, we define the shorthand 

5(X) := £ 6(x) . 
vex 

(c) Let h : X — > V be an arbitrary mapping. The distribution induced by h is the function 
5 : V — > uj with 

S( v ) := \h- l (v)\ . 

(d) A distribution 5 is k-sparse if 

5(Z) <\Z\+k ■ \B<${Z)\ , for every Z C V . 

Given a /c-sparse distribution 5 we will construct the desired function g : V — > V by 
solving a network flow problem. 

Definition 4.2. Let © = (V,E) be an undirected graph. 

(a) A flow of © is a function / : V x V — > Z such that, for all it, u € V, 

• f(u,v) = -f{v,u) and 

• f(u,v) / implies (it, u) G £7. 

(b) A flow / is acyclic if there is no cycle «o> • • • ,Um of © such that /(u m ,«o) > and 
f(ui,Ui + i) > 0, for all i < m. 

(c) The defect of a flow / is the distribution 

:= X] /( v ' n ) • 

(d) A flow / is a 5-flow if, for every v £ V, either 

df(v) = 5(v) — 1 or 6(v) = and df(v) = . 
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(e) A flow / is edge-bounded by k if \f(u,v)\ < k, for all u,v G V. We call / vertex-bounded 
by k if 



^2\f(u,v)\ <k, foralluey. 



Our aim is to show that, for every fc-sparse distribution 5 there is a bounded <5-flow / and a 
function g : V — > V inducing 5. Furthermore, if 5 is definable then g should also be definable. 

Definition 4.3. Let L be a logic. 

(a) A distribution 5 is L-definable if there exist formulae (fi(x) € L, i < k, such that 

8\=<Pi(v) iff *(«)=». 

(b) Similarly, a flow / is L-definable if there exist formulae ipi(x,y) € L such that 

© (= ifi(u,v) iff /(it, v) = « • 

Remark 4.4. Note that every edge-bounded flow can be encoded with the help of the 
GSO-parameters 

Si := { (u, v) € E | /(it, v) = i } . 

For trees the problem of encoding sets by vertices has been solved by Colcombet and 
Loding [lj. In the general case proved below the function g is only definable with the help 
of GSO-parameters, but for trees we can do without them. 

Theorem 4.5 (Colcombet and Loding [T]). Let % = (T,E) be an infinite directed tree and 
5 a WMSO-definable k-sparse distribution of %. There exists a WMSO- definable flow f that 
is edge-bounded by 7k and satisfies df(v) > 5(v) — 1, for all v. 

Theorem 4.6 (Colcombet and Loding |Tj). Let 1 = (T, E) be a directed tree and 5 a WMSO- 
definable k-sparse distribution of T such that 5(T) < \T\. There exists WMSO-definable 
function g : T — >■ T such that 5 is the distribution induced by g. 

To prove our generalisation of these results we start with a few lemmas about bounded 
flows. The first two follow immediately from the definitions. 

Lemma 4.7. Every flow that is vertex-bounded by k is also edge-bounded by k. 

Lemma 4.8. Suppose that & is a graph with maximal degree d. Every flow of (5 that is 
edge-bounded by k is vertex-bounded by dk. 

Lemma 4.9. For every 8-flow f there exists an acyclic 5-flow f such that, if f is edge- 
bounded by k or vertex-bounded by k then so is f. 

Proof. We repeat the following construction until the flow is acyclic. Select a cycle ito, • • • , u m 
such that 

c := min { f(v,i, Mj+i) | i < m } > . 

We define /' by 

f(x, y) — c if x = Ui and y = n^+i , for some i , 
f'(x, y) := < f(x, y) + c if x = Uj_)_i and y = m , for some i , 

f(x, y) otherwise . □ 
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Proposition 4.10. Let & = {V, E) be an undirected graph and 8 a k-sparse distribution. 
Then <$ has a 5 -flow f that is edge-bounded by k. 

Proof. First, we assume that <5 is finite. In this case we can reduce the task to a network 
flow problem. Let Sj be the graph obtained from & by adding two new vertices s and t that 
are connected to every vertex of (5. We define the capacity c(e) of edges e of Sj as follows. 
For edges e of © we set c(e) := k. If e = (s,v) with v <G V we set c(e) := max{0, 5(v) — 1}. 
Finally, if e = (v, t) with v G V we define 



c(e) := 



if(5(u)>0, 

1 otherwise . 

Let / be a maximal flow from s to t with respect to c. We claim that its restriction to the 
edges of (5 is the desired flow. 

According to the Max-Flow Min-Cut Theorem, there is a set X of vertices containing s 
but not t such that the maximal flow m from s to t equals 

m = c ( e ) ■ 

eeB^X) 

Let X := X \ {s} C V and Y := <S _1 (0). Since 

BflOY) = B <5 (X )U{(v,t) \ v e X }U{(s,v) \veV\X }, 

we have 

m = c ( e ) 

eeB„(X) 

= k ■ \B e (x )\ + \x n y\ + <5(y \ x ) - |(y \ x ) \ y\ 

= k ■ \B e (X )\ + \X \ + 5(V \X )-\(V\ X ) \ Y\ - \X \ Y\ 
> 5(X ) + 5(V\X ) -\V\Y\ 
= 5{V) -\V\Y\. 
On the other hand, for the set X = {s}, we have 

m< Yl c{e) = ^2max{Q, 6{v) -1} = 6(V) -\V\Y\. 

e€Bf,(X) v€V 
Consequently, the maximal flow m from s to t equals 

m = S(V) - \V\Y\. 

This implies that 

f(s, v) = max{0, S(v) — 1} , for every v G V . 
For each v £ V, we therefore have 

0= f(u,v) = ma X {0,S(v)-l} + f(t,v) + ^2f(u,v). 

uevu{s,t} usv 

If 5(v) > this implies 

6{v) - 1 - Y f( v > u ) = ' that is d f (v) = 5(v)-l, 
uev 
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while, for 5(v) = 0, we have 

-f(v,t)-^f(v,u) = 0. 

uev 

Hence, either df(v) = — 1 = 5(v) — 1 or df(v) = 0. 

It remains to prove the lemma for infinite graphs. Let ^(0) consist of the elementary 
diagram of <3 together with first-order formulae stating that / is a 5-flow on (5 that is 
edge-bounded by k. We will use the compactness theorem to show that <P((5) is satisfiable. 

Let (Pq C $(<5) be finite. There exists a finite induced subgraph ©o = (Vb, Eq) of (5 such 
that Q 5?((8o)- Let (uo,t>o), • • • , (« m -i, u m -i) De an enumeration (without repetitions) of 
all edges (u,v) with u G Vb and u G V \ Vb. We construct a new graph O = (Vq,E' ) by 
attaching to each vertex Ui a path Pj of length /c. Let 5' be the distribution on &' with 
c)'(v) = <5(v), for v G Vb, and S'(v) = 0, for f G Vg \Vb- In order to show that is satisfiable 
it is sufficient to prove that &' has a flow of the desired form. Consider an arbitrary set 
X C Vq of vertices. Let 

/ := { i | itj G X } and J := { i \ m G X and Pj C X } . 

It follows that 

S'(x) = 5(x n Vb) < I A" n Vbl + fc • n v )\ 

< \x\-k-\j\ + k-\B % (xnv )\ 

<\X\-k-\J\ + k- {\B &o (X)\ + I J|) = |X| + k ■ \B % {X)\ . 

By the first part of the proof it follows that <5' has a flow of the desired form. □ 

It remains to show how we can use the <5-flow / we have just constructed to define the 
desired function g : V — > V. We start by selecting a certain family of definable paths. Note 
that we allow paths of length 0. Such paths are uniquely determined by the vertex they start 
(and end) at. 

Lemma 4.11. Let (5 be a countable undirected graph and f an acyclic 5-flow of (5. There 
exists a set V of finite paths through & satisfying the following conditions: 

(i) For every v G V , there are exactly 5(v) paths in V starting at v. 

(ii) For every v G V there is at most one path in V ending at v. 

(iii) For every pair u, v G V of vertices there are at most f(u,v) paths in V containing the 
edge (u,v) (in this direction). 

Proof. Fix an enumeration (v n , k n ) n<LU of the set 

{ (v, k) | v G V, 0<k< 5(v) } . 
For n < oj, we construct paths ir n with the following properties: 

• TT n starts at v n . 

• If m 7^ n then the endpoints of ir m and ir n are different. 

• For every edge (u,v) there are at most f(u,v) paths 7T„ containing the edge (u,v). 
By induction, suppose that we have already defined 7Tj, for i < n. Let 

(1) a(y) be the number of paths 7Tj, i < n, starting at v, 

(2) f3(v) the number of paths 7Tj, i < n, ending at v, and 

(3) fJ,(u,v) the number of paths 7Tj, i < n, containing the edge (u,v). 
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We construct a path uq ■ ■ ■ u m inductively starting with no := v n . For the induction step, 
suppose that we have already defined no, • • • , nj. If /3(iii) = then we stop and set ir n : = 
uq . . . U{. Otherwise, we claim that there is some neighbour w of Ui with f(v,i, w) > jU(nj, w). 
Hence, we can set n^+i := w. 

To prove the claim, we distinguish two cases. If i = then a(uo) < 6(uq) implies that 

< a(u ) - 1 + { ffa n ) | f(x, n ) > } 

= a(n ) - 1 + { /( n o, x) | /(n , x) > } - (J(n ) - 1) 

< ^{/K,z) I f(uo,x) > 0}, 

as desired. Similarly, if i > then u(nj_i,nj) < f(iii-i,Ui) implies that 
^ Kui,x) = a(ui) - f3{ui) + ^ nj) 

< a(nj) - 1 + Y { Z( x ' u i) I ^j) > } 

= a( Ui ) -l + Yi f( u ^ x ) I f( u ^ *) > } - ( S ^i) ~ 1) 

<^{/(n,,x) |/(n,,x)>0}. 

Note that the construction of 7r n must terminate after at most n + 1 steps since the flow / 
is acyclic and there are only n vertices n with /3(n) = 1. □ 

Lemma 4.12. There exist GSO-formulae (p m (X; Z), form < uj, such that, for every graph (J5 
and each set V of finite paths such that every vertex and every edge of (J5 is contained in at 
most m paths ofV, there exists a tuple S of GSO -parameters such that 

® \= L Pm{P] S) iff P is (the set of edges of) a nonempty path in V . 

Proof. For every edge (n, v) of & we fix a bijection n(n, v) : [n] — > V e where V e C "P is the 
set of all paths containing the edge (u,v) (in either direction) and n := \P e \- We assume 
that u(n,i>) = fj,(v,u). 

Let 5 1 be the set of all edges of (5 contained in some path in V. By Lemma 12.171 there 
exists an MSO-formula x{ x i Vi z 'i with parameters S' such that, for every v € V, the 
formula x{ x i Vi v \ linearly orders the set of all vertices that are connected to v via an 
edge in S. 

Finally, we define unary predicates Q l £ containing all vertices v such that there exists 
a path 7r € V containing edges (n, v), (v,w) where 

• fi(u,v)(k) = 7r, fj,(v,w)(l) = 7T, 

• n is the i-th neighbour of v (in the order defined by x)i 

• w is the j-th neighbour of v. 

It follows that a nonempty set P C E of edges is a path in if and only if P is a minimal 
nonempty subset of E satisfying the following condition: 

P can be written as a union P = Pq U • • • U P m -l such that, for all vertices n, v, w such 
that v € Q^r and n and u; are, respectively, the i-th and j-th neighbour of u, we have 
(u,v) eP^ (v,u>) E P^. 
This condition can be expressed in GSO. □ 
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Remark 4.13. Note that the set of empty paths in V is trivially definable with the help of 
the parameter 

Q := { v € V | V contains an empty path from v to v } . 

Using the family V we can construct a formula 93 defining the function g. 

Proposition 4.14. There exist GSO-formulae ip m (x,y; Z), for m < u, with the following 
property: for every graph & = (V,E) and each acyclic 5-flow f of & that is vertex-bounded 
by m, there exist GSO -parameters S such that ip m (x,y; S) defines on & a partial function 
g : V — > V with 

\ g - l ( v )\ = S(v) , forallveV. 

Proof. Let (&' be the graph obtained from <3 by removing every edge (u, v) with f(u, v) = 0. 
Note that / is also a 5-flow of (5'. Since / is vertex-bounded by m it follows that every 
vertex of & has degree at most m < to. Consequently, each connected component <3q of (5 
is countable. Let Vq be the set of paths obtained by applying Lemma [4, 111 to the restriction 
of / to ®o> an d let V be the union of all these sets Vq corresponding to the connected 
components of & '. By Lemma 14,121 there exists a formula tp(X; Z) and a set S of guarded 
relations such that 

(5 \= ij){P\ S) iff P is a nonempty path in V . 
With the help of i/j we can define a partial function g : V — > V such that 

g(v) = u : iff V contains a path from u to v . 

By construction of V we have = 5(v), for every v £ V. □ 

Lemma 4.15. Let & = (V,E) be a graph of finite degree and ip(X,y) a GSO-formula that 
defines a partial function h : &{V) — > V such that the distribution 5 induced by h is k- 
sparse. Suppose that there exists a GSO-formula x(X, Y, z) such that, for every vertex v € V, 
x(X,Y,v) linearly orders the set Then there exist MSO -definable partial functions 

ho : &(V) —> V and g : V — > V such that h = g o /i and ho is infective. 

Proof. By Proposition 14.101 there exists a 5-flow / that is edge-bounded by k. Since (3 has 
finite degree it follows that / is vertex-bounded by some constant m < uj. Hence, we can 
use Proposition 14, 14l to find a definable function g : V — > V with = S(v) = |/i -1 (i>)|. 

Choose unary predicates Pq, . . . , Pk-i such that we have i 7^ I whenever u € Pi and v € Pi 
are distinct vertices with g(u) = g(v). Using these predicate we can define partial functions 
90: • • • > 9k— 1 '■ V — > V such that gi(v) is the unique element of n Pj. We define 

/io : &(V) — > V by ho(X) := (gi o h)(X) where the index i is chosen such that X is the 
i-th element of h~ l {h{X)) (in the order defined by \). It follows that h(X) = g(ho(X)) and 
ho is injective. Furthermore, the function ho is clearly GSO-definable. Since the graph (5 
has degree at most k it is /c-sparse. Hence, every GSO-definable function is already MSO- 
definable. □ 

Recall that ^P^ n {V) denotes the set of all finite subsets of V. Combining the preceding 
lemmas we obtain the main result of this section. 

Theorem 4.16. Let <3 = (V,E) be a graph of finite degree and ip(X,y) a GSO-formula 
that defines a partial function h : &ft n (V) — > V such that the distribution 5 induced by h is 
k-sparse. Then there exist MSO-definable partial functions ho : <^fi n (U) — > V and g : V — > V 
such that h = g o ho and ho is injective. 
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Proof. By the preceding lemma it is sufficient to construct a formua x(^> Y, z) (with GSO- 
parameters) such that x(X, Y,v) linearly orders h~ l (v), for every v € V. Let To C E be 
a spanning forest of <3 and let P C V be a set containing exactly one element of each 
connected component. Using the parameters P and To we can define the tree ordering on V 
by 

u < v : iff the unique path in To from some element of P to v contains u . 

Let T C V x V be the set obtained from To by orienting the edges according to this 
ordering. Then T is a directed forest. Furthermore, since the degree of (5 is bounded we 
can use Lemma 12.171 to linearly order the successors of every vertex in T. We use these 
two orderings to define the lexicographic ordering <i ex on T. Finally, we obtain the desired 
ordering on <^fi n (V) by setting 

X < Y : iff the <i ex -minimal element of (X \ Y) U (Y \ X) belongs to Y . 

Each of these definitions can be expressed in GSO. □ 

5. Conclusion 

We have presented several methods to encode sets of finite vertices as single vertices. In the 
first part, we used depth-first spanning trees to encode edges by vertices. As an application we 
were able to extend Courcelle's result on the collapse of GSO to MSO on sparse hypergraphs 
from countable hypergraphs to hypergraphs of arbitrary cardinality. In the second part we 
used network flows to encode arbitrary finite sets by vertices. 

Let us mention some open questions. Considering the first part it would be interesting 
to find out whether sparse classes are the only examples where GSO collapses to MSO. 

Problem 5.1. Is there a class C that is not /c-sparse, for any k, such that over C every 
GSO-sentence is equivalent to an MSO-sentence? 

The results of the second part are much less complete. It is unlikely that they are the 
best possible. 

Problem 5.2. Improve Theorem 14 . 1 6 1 by allowing 

(a) more general classes of graphs or hypergraphs ; 

(b) more general classes of partial functions h : ^(V) — > V. 

Our results were inspired by work of Colcombet and Loding [T]. The question arises of 
whether we can also generalise the remaining results of that article. 

Problem 5.3. Can we prove Corollary 4.4 of p,] for other graphs than trees? 
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